Sentence boundaries in text and pauses in speech: Correlation or confrontation?

نویسندگان

  • Anton Stepikhov
  • Anastassia Loukina
چکیده

The paper explores the interaction between sentence boundaries marked by annotators in transcriptions of Russian spontaneous speech and actual prosodic boundaries in the signal. The aim of the research is to investigate whether annotators’ prosodic competence allows them to correctly detect sentence boundaries in speech based on textual information only. We found that inter-annotator agreement for each sentence boundary identified in transcription was affected by both presence or absence of pause and pause duration. Mixed linear model showed that presence or absence of pause explain 13% of variance in boundary detection. Pause duration explained only 4% of variance in inter-annotator agreement with moderate correlation of r = 0.21. We argue that relatively small size of effect in this case may be due to the interaction of different pausing strategies typical for reading and spontaneous speech, ambiguity of sentence boundaries and individual differences in speech perception.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling of sentence-medial pauses in bangla readout speech: occurrence and duration

Control of pause occurrence and duration is an important issue for text-to-speech synthesis systems. In text-readout speech, pauses occur unconditionally at sentence boundaries and with high probability at major syntactic boundaries such as clause boundaries, but more or less arbitrarily at minor syntactic boundaries. Pause duration tends to be longer at the end of a longer syntactic unit. A de...

متن کامل

Effects of Pause Insertion on the Intelligibility of Low Quality Speech

The intelligibility of the Output of text-to-speech Systems today is generally poorer than that of natural human speech. One way of improving the quality of eynthetic speech is to insert speech pauses at selected positions in the utterances rather more frequently than the human reader would choose to do. Pause insertion has been reported to improve intelligibility in deaf-speech [1] äs well äs ...

متن کامل

Examining the Association between T-unit and Pausing Length on the EFL Perception of Listening Comprehension

Listening taking over half of the learners’ time and effort (Nunan, 1998), forms a basis for acquiring much of a language. There are factors affecting listening comprehension and its perception, such as the speech rate, phonological properties of the text, the quality of the recording, the learners’ anxiety, and listening comprehension strategies (Goh, 2000; Hamouda, 2013). At the Iran Language...

متن کامل

Factors influencing ratios of filled pauses at clause boundaries in Japanese

Speech disfluencies have been studied as clues to human speech production mechanisms. Major constituents are assumed to be principal units of planning and disfluencies are claimed to occur when speakers have some trouble in planning such units. We tested two hypotheses about the probability of disfluencies by examining the ratios of filled pauses (fillers) at sentence and clause boundaries: 1) ...

متن کامل

Factors Affecting the Occurrence and Duration of Sentence-medial Pauses in Japanese Text Reading

Pauses play important roles for the intelligibility and naturalness of speech. Their occurrence and duration in text reading are influenced by syntactic structures of the text as well as by physiological constraints of respiration on the part of the speaker. In contrast to sentenceand paragraph-final pauses, sentence-medial pauses are influenced by a number of factors. Analysis of Japanese news...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015